Overview
Dataset statistics
| Number of variables | 9 |
|---|---|
| Number of observations | 932 |
| Missing cells | 50 |
| Missing cells (%) | 0.6% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 65.7 KiB |
| Average record size in memory | 72.1 B |
Variable types
| Numeric | 7 |
|---|---|
| Categorical | 2 |
Builtup_area is highly overall correlated with Carpet_area | High correlation |
Carpet_area is highly overall correlated with Builtup_area | High correlation |
Hospital_dist is highly overall correlated with Market_dist and 1 other fields | High correlation |
Market_dist is highly overall correlated with Hospital_dist | High correlation |
Taxi_dist is highly overall correlated with Hospital_dist | High correlation |
Taxi_dist has 13 (1.4%) missing values | Missing |
Market_dist has 13 (1.4%) missing values | Missing |
Builtup_area has 15 (1.6%) missing values | Missing |
Carpet_area is highly skewed (γ1 = 25.95686468) | Skewed |
Price_house is highly skewed (γ1 = 25.27467092) | Skewed |
Reproduction
| Analysis started | 2026-02-22 15:50:20.852996 |
|---|---|
| Analysis finished | 2026-02-22 15:50:28.078152 |
| Duration | 7.23 seconds |
| Software version | ydata-profiling vv4.18.1 |
| Download configuration | config.json |
Variables
Taxi_dist
Real number (ℝ)
High correlation Missing
| Distinct | 884 |
|---|---|
| Distinct (%) | 96.2% |
| Missing | 13 |
| Missing (%) | 1.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 8229.728 |
| Minimum | 146 |
|---|---|
| Maximum | 20662 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.4 KiB |
Quantile statistics
| Minimum | 146 |
|---|---|
| 5-th percentile | 4111.8 |
| Q1 | 6476 |
| median | 8230 |
| Q3 | 9937 |
| 95-th percentile | 12386.3 |
| Maximum | 20662 |
| Range | 20516 |
| Interquartile range (IQR) | 3461 |
Descriptive statistics
| Standard deviation | 2561.985 |
|---|---|
| Coefficient of variation (CV) | 0.31130859 |
| Kurtosis | 0.4425674 |
| Mean | 8229.728 |
| Median Absolute Deviation (MAD) | 1741 |
| Skewness | 0.14280014 |
| Sum | 7563120 |
| Variance | 6563767.2 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 7214 | 2 | 0.2% |
| 7739 | 2 | 0.2% |
| 12794 | 2 | 0.2% |
| 8510 | 2 | 0.2% |
| 4846 | 2 | 0.2% |
| 10662 | 2 | 0.2% |
| 8475 | 2 | 0.2% |
| 9154 | 2 | 0.2% |
| 8085 | 2 | 0.2% |
| 4917 | 2 | 0.2% |
| Other values (874) | 899 | |
| (Missing) | 13 | 1.4% |
| Value | Count | Frequency (%) |
| 146 | 1 | |
| 604 | 1 | |
| 1200 | 1 | |
| 1241 | 1 | |
| 1637 | 1 | |
| 1648 | 1 | |
| 1868 | 1 | |
| 2017 | 1 | |
| 2222 | 1 | |
| 2314 | 1 |
| Value | Count | Frequency (%) |
| 20662 | 1 | |
| 16850 | 1 | |
| 16233 | 1 | |
| 15522 | 1 | |
| 15321 | 1 | |
| 15082 | 1 | |
| 14637 | 1 | |
| 14470 | 1 | |
| 14306 | 1 | |
| 14005 | 1 |
Market_dist
Real number (ℝ)
High correlation Missing
| Distinct | 866 |
|---|---|
| Distinct (%) | 94.2% |
| Missing | 13 |
| Missing (%) | 1.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 11018.753 |
| Minimum | 1666 |
|---|---|
| Maximum | 20945 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.4 KiB |
Quantile statistics
| Minimum | 1666 |
|---|---|
| 5-th percentile | 6713.3 |
| Q1 | 9354.5 |
| median | 11161 |
| Q3 | 12670.5 |
| 95-th percentile | 14999.9 |
| Maximum | 20945 |
| Range | 19279 |
| Interquartile range (IQR) | 3316 |
Descriptive statistics
| Standard deviation | 2543.9206 |
|---|---|
| Coefficient of variation (CV) | 0.23087191 |
| Kurtosis | 0.050138569 |
| Mean | 11018.753 |
| Median Absolute Deviation (MAD) | 1687 |
| Skewness | -0.037130815 |
| Sum | 10126234 |
| Variance | 6471532 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 11465 | 3 | 0.3% |
| 11673 | 2 | 0.2% |
| 15127 | 2 | 0.2% |
| 13333 | 2 | 0.2% |
| 10781 | 2 | 0.2% |
| 11562 | 2 | 0.2% |
| 11622 | 2 | 0.2% |
| 12656 | 2 | 0.2% |
| 10134 | 2 | 0.2% |
| 11589 | 2 | 0.2% |
| Other values (856) | 898 | |
| (Missing) | 13 | 1.4% |
| Value | Count | Frequency (%) |
| 1666 | 1 | |
| 4402 | 1 | |
| 4574 | 1 | |
| 4644 | 1 | |
| 4950 | 1 | |
| 5134 | 1 | |
| 5142 | 1 | |
| 5177 | 1 | |
| 5250 | 1 | |
| 5276 | 1 |
| Value | Count | Frequency (%) |
| 20945 | 1 | |
| 18281 | 1 | |
| 17958 | 1 | |
| 17552 | 1 | |
| 17541 | 1 | |
| 17486 | 1 | |
| 17227 | 1 | |
| 17111 | 1 | |
| 17101 | 1 | |
| 17040 | 1 |
Hospital_dist
Real number (ℝ)
High correlation
| Distinct | 895 |
|---|---|
| Distinct (%) | 96.1% |
| Missing | 1 |
| Missing (%) | 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 13072.092 |
| Minimum | 3227 |
|---|---|
| Maximum | 23294 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.4 KiB |
Quantile statistics
| Minimum | 3227 |
|---|---|
| 5-th percentile | 8731.5 |
| Q1 | 11301.5 |
| median | 13163 |
| Q3 | 14817 |
| 95-th percentile | 17048 |
| Maximum | 23294 |
| Range | 20067 |
| Interquartile range (IQR) | 3515.5 |
Descriptive statistics
| Standard deviation | 2586.4562 |
|---|---|
| Coefficient of variation (CV) | 0.19786092 |
| Kurtosis | 0.27178913 |
| Mean | 13072.092 |
| Median Absolute Deviation (MAD) | 1742 |
| Skewness | -0.060977825 |
| Sum | 12170118 |
| Variance | 6689755.5 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 15491 | 2 | 0.2% |
| 12694 | 2 | 0.2% |
| 11170 | 2 | 0.2% |
| 14479 | 2 | 0.2% |
| 15845 | 2 | 0.2% |
| 13016 | 2 | 0.2% |
| 15721 | 2 | 0.2% |
| 14405 | 2 | 0.2% |
| 12454 | 2 | 0.2% |
| 12267 | 2 | 0.2% |
| Other values (885) | 911 |
| Value | Count | Frequency (%) |
| 3227 | 1 | |
| 4922 | 1 | |
| 5446 | 1 | |
| 5913 | 1 | |
| 6316 | 1 | |
| 6422 | 1 | |
| 6583 | 1 | |
| 6764 | 1 | |
| 6808 | 1 | |
| 7251 | 1 |
| Value | Count | Frequency (%) |
| 23294 | 1 | |
| 22407 | 1 | |
| 20263 | 1 | |
| 20076 | 1 | |
| 20046 | 1 | |
| 19617 | 1 | |
| 19497 | 1 | |
| 19046 | 1 | |
| 19014 | 1 | |
| 18836 | 1 |
Carpet_area
Real number (ℝ)
High correlation Skewed
| Distinct | 595 |
|---|---|
| Distinct (%) | 64.4% |
| Missing | 8 |
| Missing (%) | 0.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1511.8626 |
| Minimum | 775 |
|---|---|
| Maximum | 24300 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.4 KiB |
Quantile statistics
| Minimum | 775 |
|---|---|
| 5-th percentile | 1079.05 |
| Q1 | 1318 |
| median | 1480.5 |
| Q3 | 1655 |
| 95-th percentile | 1908.85 |
| Maximum | 24300 |
| Range | 23525 |
| Interquartile range (IQR) | 337 |
Descriptive statistics
| Standard deviation | 790.96966 |
|---|---|
| Coefficient of variation (CV) | 0.52317564 |
| Kurtosis | 748.32524 |
| Mean | 1511.8626 |
| Median Absolute Deviation (MAD) | 167.5 |
| Skewness | 25.956865 |
| Sum | 1396961 |
| Variance | 625633 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 1439 | 5 | 0.5% |
| 1514 | 5 | 0.5% |
| 1458 | 5 | 0.5% |
| 1539 | 5 | 0.5% |
| 1440 | 5 | 0.5% |
| 1513 | 5 | 0.5% |
| 1250 | 4 | 0.4% |
| 1462 | 4 | 0.4% |
| 1174 | 4 | 0.4% |
| 1609 | 4 | 0.4% |
| Other values (585) | 878 | |
| (Missing) | 8 | 0.9% |
| Value | Count | Frequency (%) |
| 775 | 1 | |
| 780 | 1 | |
| 854 | 1 | |
| 869 | 1 | |
| 891 | 1 | |
| 896 | 1 | |
| 902 | 2 | |
| 913 | 1 | |
| 919 | 1 | |
| 932 | 2 |
| Value | Count | Frequency (%) |
| 24300 | 1 | |
| 2229 | 1 | |
| 2215 | 1 | |
| 2214 | 1 | |
| 2169 | 1 | |
| 2067 | 1 | |
| 2063 | 1 | |
| 2049 | 1 | |
| 2044 | 2 | |
| 2026 | 1 |
Builtup_area
Real number (ℝ)
High correlation Missing
| Distinct | 626 |
|---|---|
| Distinct (%) | 68.3% |
| Missing | 15 |
| Missing (%) | 1.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1794.9248 |
| Minimum | 932 |
|---|---|
| Maximum | 12730 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.4 KiB |
Quantile statistics
| Minimum | 932 |
|---|---|
| 5-th percentile | 1298.4 |
| Q1 | 1583 |
| median | 1774 |
| Q3 | 1982 |
| 95-th percentile | 2280.8 |
| Maximum | 12730 |
| Range | 11798 |
| Interquartile range (IQR) | 399 |
Descriptive statistics
| Standard deviation | 468.15946 |
|---|---|
| Coefficient of variation (CV) | 0.260824 |
| Kurtosis | 324.52653 |
| Mean | 1794.9248 |
| Median Absolute Deviation (MAD) | 200 |
| Skewness | 13.919406 |
| Sum | 1645946 |
| Variance | 219173.28 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 1858 | 5 | 0.5% |
| 1648 | 4 | 0.4% |
| 1869 | 4 | 0.4% |
| 1820 | 4 | 0.4% |
| 1746 | 4 | 0.4% |
| 2065 | 4 | 0.4% |
| 1733 | 4 | 0.4% |
| 1734 | 4 | 0.4% |
| 1943 | 4 | 0.4% |
| 2262 | 4 | 0.4% |
| Other values (616) | 876 | |
| (Missing) | 15 | 1.6% |
| Value | Count | Frequency (%) |
| 932 | 1 | |
| 951 | 1 | |
| 1018 | 1 | |
| 1050 | 1 | |
| 1059 | 1 | |
| 1073 | 1 | |
| 1087 | 1 | |
| 1093 | 1 | |
| 1099 | 1 | |
| 1106 | 1 |
| Value | Count | Frequency (%) |
| 12730 | 1 | |
| 2667 | 1 | |
| 2647 | 1 | |
| 2617 | 1 | |
| 2493 | 1 | |
| 2492 | 1 | |
| 2474 | 1 | |
| 2465 | 1 | |
| 2436 | 1 | |
| 2420 | 1 |
Parking_type
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.4 KiB |
| Open | |
|---|---|
| Not Provided | |
| Covered | |
| No Parking |
Length
| Max length | 12 |
|---|---|
| Median length | 10 |
| Mean length | 7.4871245 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Open |
|---|---|
| 2nd row | Not Provided |
| 3rd row | Not Provided |
| 4th row | Covered |
| 5th row | Not Provided |
Common Values
| Value | Count | Frequency (%) |
| Open | 372 | |
| Not Provided | 227 | |
| Covered | 188 | |
| No Parking | 145 | 15.6% |
Length
Histogram of lengths of the category
Common Values (Plot)
| Value | Count | Frequency (%) |
| open | 372 | |
| not | 227 | |
| provided | 227 | |
| covered | 188 | |
| no | 145 | 11.1% |
| parking | 145 | 11.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 975 | |
| o | 787 | |
| d | 642 | |
| r | 560 | 8.0% |
| n | 517 | 7.4% |
| v | 415 | 5.9% |
| O | 372 | 5.3% |
| N | 372 | 5.3% |
| p | 372 | 5.3% |
| 372 | 5.3% | |
| Other values (7) | 1594 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 6978 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 975 | |
| o | 787 | |
| d | 642 | |
| r | 560 | 8.0% |
| n | 517 | 7.4% |
| v | 415 | 5.9% |
| O | 372 | 5.3% |
| N | 372 | 5.3% |
| p | 372 | 5.3% |
| 372 | 5.3% | |
| Other values (7) | 1594 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 6978 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 975 | |
| o | 787 | |
| d | 642 | |
| r | 560 | 8.0% |
| n | 517 | 7.4% |
| v | 415 | 5.9% |
| O | 372 | 5.3% |
| N | 372 | 5.3% |
| p | 372 | 5.3% |
| 372 | 5.3% | |
| Other values (7) | 1594 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 6978 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 975 | |
| o | 787 | |
| d | 642 | |
| r | 560 | 8.0% |
| n | 517 | 7.4% |
| v | 415 | 5.9% |
| O | 372 | 5.3% |
| N | 372 | 5.3% |
| p | 372 | 5.3% |
| 372 | 5.3% | |
| Other values (7) | 1594 |
City_type
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.4 KiB |
| CAT B | |
|---|---|
| CAT A | |
| CAT C |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 5 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | CAT B |
|---|---|
| 2nd row | CAT B |
| 3rd row | CAT A |
| 4th row | CAT B |
| 5th row | CAT B |
Common Values
| Value | Count | Frequency (%) |
| CAT B | 365 | |
| CAT A | 329 | |
| CAT C | 238 |
Length
Histogram of lengths of the category
Common Values (Plot)
| Value | Count | Frequency (%) |
| cat | 932 | |
| b | 365 | 19.6% |
| a | 329 | 17.7% |
| c | 238 | 12.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 1261 | |
| C | 1170 | |
| T | 932 | |
| 932 | ||
| B | 365 | 7.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 4660 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| A | 1261 | |
| C | 1170 | |
| T | 932 | |
| 932 | ||
| B | 365 | 7.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 4660 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| A | 1261 | |
| C | 1170 | |
| T | 932 | |
| 932 | ||
| B | 365 | 7.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 4660 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| A | 1261 | |
| C | 1170 | |
| T | 932 | |
| 932 | ||
| B | 365 | 7.8% |
Rainfall
Real number (ℝ)
| Distinct | 131 |
|---|---|
| Distinct (%) | 14.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 785.5794 |
| Minimum | -110 |
|---|---|
| Maximum | 1560 |
| Zeros | 1 |
| Zeros (%) | 0.1% |
| Negative | 1 |
| Negative (%) | 0.1% |
| Memory size | 7.4 KiB |
Quantile statistics
| Minimum | -110 |
|---|---|
| 5-th percentile | 360 |
| Q1 | 600 |
| median | 780 |
| Q3 | 970 |
| 95-th percentile | 1220 |
| Maximum | 1560 |
| Range | 1670 |
| Interquartile range (IQR) | 370 |
Descriptive statistics
| Standard deviation | 265.54685 |
|---|---|
| Coefficient of variation (CV) | 0.33802675 |
| Kurtosis | -0.17902532 |
| Mean | 785.5794 |
| Median Absolute Deviation (MAD) | 180 |
| Skewness | 0.047163242 |
| Sum | 732160 |
| Variance | 70515.131 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 670 | 19 | 2.0% |
| 790 | 19 | 2.0% |
| 760 | 18 | 1.9% |
| 680 | 17 | 1.8% |
| 770 | 16 | 1.7% |
| 660 | 16 | 1.7% |
| 730 | 16 | 1.7% |
| 700 | 15 | 1.6% |
| 860 | 15 | 1.6% |
| 900 | 15 | 1.6% |
| Other values (121) | 766 |
| Value | Count | Frequency (%) |
| -110 | 1 | 0.1% |
| 0 | 1 | 0.1% |
| 70 | 1 | 0.1% |
| 100 | 1 | 0.1% |
| 120 | 1 | 0.1% |
| 130 | 1 | 0.1% |
| 140 | 1 | 0.1% |
| 160 | 1 | 0.1% |
| 190 | 1 | 0.1% |
| 210 | 3 |
| Value | Count | Frequency (%) |
| 1560 | 1 | |
| 1530 | 1 | |
| 1490 | 1 | |
| 1470 | 1 | |
| 1450 | 1 | |
| 1440 | 2 | |
| 1410 | 2 | |
| 1400 | 1 | |
| 1390 | 1 | |
| 1380 | 1 |
Price_house
Real number (ℝ)
Skewed
| Distinct | 849 |
|---|---|
| Distinct (%) | 91.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6084695.3 |
| Minimum | 30000 |
|---|---|
| Maximum | 1.5 × 108 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.4 KiB |
Quantile statistics
| Minimum | 30000 |
|---|---|
| 5-th percentile | 3307050 |
| Q1 | 4658000 |
| median | 5866000 |
| Q3 | 7187250 |
| 95-th percentile | 8790800 |
| Maximum | 1.5 × 108 |
| Range | 1.4997 × 108 |
| Interquartile range (IQR) | 2529250 |
Descriptive statistics
| Standard deviation | 5025363.9 |
|---|---|
| Coefficient of variation (CV) | 0.82590231 |
| Kurtosis | 724.14922 |
| Mean | 6084695.3 |
| Median Absolute Deviation (MAD) | 1261000 |
| Skewness | 25.274671 |
| Sum | 5.670936 × 109 |
| Variance | 2.5254282 × 1013 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 6354000 | 3 | 0.3% |
| 5459000 | 3 | 0.3% |
| 5644000 | 2 | 0.2% |
| 7813000 | 2 | 0.2% |
| 6228000 | 2 | 0.2% |
| 6366000 | 2 | 0.2% |
| 7653000 | 2 | 0.2% |
| 6218000 | 2 | 0.2% |
| 7809000 | 2 | 0.2% |
| 4605000 | 2 | 0.2% |
| Other values (839) | 910 |
| Value | Count | Frequency (%) |
| 30000 | 1 | |
| 1492000 | 1 | |
| 1637000 | 1 | |
| 1767000 | 1 | |
| 2027000 | 1 | |
| 2070000 | 1 | |
| 2130000 | 1 | |
| 2147000 | 1 | |
| 2165000 | 1 | |
| 2175000 | 1 |
| Value | Count | Frequency (%) |
| 150000000 | 1 | |
| 11632000 | 1 | |
| 10515000 | 1 | |
| 10292000 | 1 | |
| 10231000 | 1 | |
| 10182000 | 1 | |
| 10178000 | 1 | |
| 10112000 | 1 | |
| 10090000 | 1 | |
| 9957000 | 1 |
Interactions
Correlations
| Builtup_area | Carpet_area | City_type | Hospital_dist | Market_dist | Parking_type | Price_house | Rainfall | Taxi_dist | |
|---|---|---|---|---|---|---|---|---|---|
| Builtup_area | 1.000 | 0.999 | 0.000 | 0.009 | -0.015 | 0.000 | 0.089 | -0.035 | 0.009 |
| Carpet_area | 0.999 | 1.000 | 0.000 | 0.011 | -0.014 | 0.033 | 0.095 | -0.039 | 0.013 |
| City_type | 0.000 | 0.000 | 1.000 | 0.000 | 0.056 | 0.000 | 0.000 | 0.000 | 0.000 |
| Hospital_dist | 0.009 | 0.011 | 0.000 | 1.000 | 0.588 | 0.067 | 0.138 | 0.049 | 0.782 |
| Market_dist | -0.015 | -0.014 | 0.056 | 0.588 | 1.000 | 0.030 | 0.123 | 0.063 | 0.418 |
| Parking_type | 0.000 | 0.033 | 0.000 | 0.067 | 0.030 | 1.000 | 0.032 | 0.058 | 0.079 |
| Price_house | 0.089 | 0.095 | 0.000 | 0.138 | 0.123 | 0.032 | 1.000 | 0.021 | 0.114 |
| Rainfall | -0.035 | -0.039 | 0.000 | 0.049 | 0.063 | 0.058 | 0.021 | 1.000 | 0.008 |
| Taxi_dist | 0.009 | 0.013 | 0.000 | 0.782 | 0.418 | 0.079 | 0.114 | 0.008 | 1.000 |
Missing values
A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.
Sample
| Taxi_dist | Market_dist | Hospital_dist | Carpet_area | Builtup_area | Parking_type | City_type | Rainfall | Price_house | |
|---|---|---|---|---|---|---|---|---|---|
| 0 | 9796.0 | 5250.0 | 10703.0 | 1659.0 | 1961.0 | Open | CAT B | 530 | 6649000 |
| 1 | 8294.0 | 8186.0 | 12694.0 | 1461.0 | 1752.0 | Not Provided | CAT B | 210 | 3982000 |
| 2 | 11001.0 | 14399.0 | 16991.0 | 1340.0 | 1609.0 | Not Provided | CAT A | 720 | 5401000 |
| 3 | 8301.0 | 11188.0 | 12289.0 | 1451.0 | 1748.0 | Covered | CAT B | 620 | 5373000 |
| 4 | 10510.0 | 12629.0 | 13921.0 | 1770.0 | 2111.0 | Not Provided | CAT B | 450 | 4662000 |
| 5 | 6665.0 | 5142.0 | 9972.0 | 1442.0 | 1733.0 | Open | CAT B | 760 | 4526000 |
| 6 | 13153.0 | 11869.0 | 17811.0 | 1542.0 | 1858.0 | No Parking | CAT A | 1030 | 7224000 |
| 7 | 5882.0 | 9948.0 | 13315.0 | 1261.0 | 1507.0 | Open | CAT C | 1020 | 3772000 |
| 8 | 7495.0 | 11589.0 | 13370.0 | 1090.0 | 1321.0 | Not Provided | CAT B | 680 | 4631000 |
| 9 | 8233.0 | 7067.0 | 11400.0 | 1030.0 | 1235.0 | Open | CAT C | 1130 | 4415000 |
| Taxi_dist | Market_dist | Hospital_dist | Carpet_area | Builtup_area | Parking_type | City_type | Rainfall | Price_house | |
|---|---|---|---|---|---|---|---|---|---|
| 922 | 9538.0 | 11551.0 | 12839.0 | 1655.0 | 1986.0 | Covered | CAT B | 1150 | 7743000 |
| 923 | 11786.0 | 13969.0 | 15519.0 | 1156.0 | 1398.0 | Open | CAT A | 140 | 9237000 |
| 924 | 9615.0 | 7904.0 | 12521.0 | 1451.0 | 1734.0 | Open | CAT C | 670 | 3488000 |
| 925 | 7176.0 | 5779.0 | 12382.0 | 1539.0 | 1829.0 | Open | CAT B | 650 | 4658000 |
| 926 | 10915.0 | 17486.0 | 15964.0 | 1549.0 | 1851.0 | Not Provided | CAT C | 1220 | 7062000 |
| 927 | 12176.0 | 8518.0 | 15673.0 | 1582.0 | 1910.0 | Covered | CAT C | 1080 | 6639000 |
| 928 | 7214.0 | 8717.0 | 10553.0 | 1387.0 | 1663.0 | Open | CAT A | 850 | 8208000 |
| 929 | 7423.0 | 11708.0 | 13220.0 | 1200.0 | 1436.0 | Open | CAT A | 1060 | 7644000 |
| 930 | 15082.0 | 14700.0 | 19617.0 | 1299.0 | 1560.0 | Open | CAT B | 770 | 9661000 |
| 931 | 9297.0 | 12537.0 | 14418.0 | 1174.0 | 1429.0 | Covered | CAT C | 1110 | 5434000 |